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AN ASYMPTOTIC TOTAL VARIATION TEST FOR COPULAS 

JEAN-DAVID FERMANIAN, DRAGAN RADULOVIC, AND MARTEN H. WEGKAMP 

Abstract. We propose a new goodness-of-fit test for copulas, based on empirical 
copula processes and their nonparametric bootstrap counterparts. The standard 
Kolmogorov-Smirnov type test for copulas that takes the supremum of the empir- 
ical copula process indexed by half spaces is extended by test statistics based on 



p^ the supremum of the empirical copula process indexed by families of L„ disjoint 



rectangles, with L„ slowly tending to infinity. Although the underlying empirical 
Q process does not converge, it is proved that the p- values of our new test statistic can 

^7" be consistently estimated by the bootstrap. Simulations confirm that the power 

of the new procedure is often higher than the power of the standard Kolmogorov- 
t^ Smirnov or the Camer-von Mises tests for copulas. 
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1. Introduction 



^ This paper introduces a new powerful goodness-of-fit (GOF) test for bivariate copulas 

^ in [0, 1]^ based on the empirical copula process 



- (1) Zniu,v) = V^{Cn-C){u,v), 0<U,V<1, 

/\ given a sample of n independent observations (Xj, YiJq Here C„ = EI„(F^, G^) is the 

^ empirical copula, introduced by Deheuvels (1979), EI„ is the joint cdf of the sample 

(Xi, Yi), . . . , (X„, Yn) and F~ and G~ denote the marginal quantile functions of the 

sample Xi, . . . , X„ and Yi, . . . , Yn, respectively. The Kolmogorov-Smirnov (KS) test 
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statistic for testing of the null hypothesis Hq : C = Co is 



(2) KS„ = sup \^/n{Cn- Co){u,v)\. 

0<u,v<l 

The Cramer-von Mises statistic (CvM) is 

(3) CM„ = /"{v^(C„ - Co){u,v)ydCn{u,v). 

It is well-known, see, for instance, Fermanian et al. (2004), that Z„ and its boot- 



strap counterpart Z*, defined in (14) below, both converge weakly to the same tight 
Gaussian process in £°°([0, 1]^) under the null hypothesis. Therefore, we can com- 
pute the a-upper points of KS^ and CM„ via the bootstrap. To the best of our 
knowledge, all the proposed GOF tests rely on simulation-based procedures to cal- 
culate their corresponding p-valueqj A parametric bootstrap has been proposed 
(Genest and Remillard, 2008) to tackle composite null hypotheses, while Remillard 
and Scaillet (2009) advocate the use of the multiplier central limit theorem to build 
an alternative bootstrap empirical copula process. Biicher and Dette (2010) gives a 
survey and a comparison of various bootstrap methods. 

The goal of this paper is to develop a more powerful test than the KS test ^ 
and CvM test (tsl). We propose the following test that rejects the null hypothesis 
Ho : C = Co for large values of the test statistic 

(4) T, := sup 5^|Z„(5fc)|. 

Bi,-,Bl„ fc=l 

The supremum is taken over all disjoint rectangles Bi, . . . , Bl^ C [0, 1]^ of the form 
(a, 6] X {c,d], using the convention 

(5) Z„((a, b] X (c, d]) = Z„(6, d) - Z„(a, d) - Z„(6, c) + Z„(a, c), 
for all < a < 6 < 1 and < c < ci < 1. 



with the notable exception of the distribution-free test statistics of Fermanian (2004). This idea 
has been further developed by Scaillet (2007) and Fermanian and Wegkamp (2012). 
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Now, ii Ln = L for all n, the collection of rectangles is sufficiently small that we 
can still appeal to the weak convergence of Z„ and Z* in conjunction with the con- 
tinuous mapping theorem, to obtain a-upper points of the test statistic T„ via the 
bootstrap. Taking L„ = +00 for all n, or equivalently, if we consider all families of 
disjoint rectangles in [0, 1]^ (possibly partitions), the statistic T„ is equal to the total 
variation distance TV(Z„) of Z„. The resulting test isn't statistically meaningful as 
TV(Z„) is maximal, to wit, Ty(Z„) = n^/^ — )■ +00. The problem is to find a rich 
collection that quickly detects departure from the null, but still yields a consistent 
test. The main novelty of our approach is the fact that we let L„, the number of 
rectangles, slowly tend to 00 in that L„ ~ (logn)''', < 7 < 1. While in this case 
the process Z„ no longer converges. Theorem [T] in Section 2 states that we can still 
consistently estimate the distribution of the process Z„ by the bootstrap. We refer 
to our procedure as the Asymptotic Total Variation (ATV) test. The considered 
families of rectangles are finer and finer, presumably improving the power of the 
test, while for each n large enough, we still have a consistent test in that we control 
the type 1 error. A key observation is that under the null hypothesis Hq : C = Co, 
we have T„ < L^sup^ |Z„(i?)| = Op{Ln), while under the alternative B.a : C = Ci 
for some fixed Ci 7^ Cq, T„ is much larger since the bias is at least of order 0(n^/^). 

Theorem IT] extends the surprising result obtained by Radulovic (2012) for empirical 



processes indexed by sums of indicator functions of VC-graph classes (Theorem 12 
in the appendix). We require very mild conditions on the copula function C. This is 
one of the few notable exceptions known to us in the literature where the bootstrap 
"works", that is, the conditional bootstrap distribution consistently estimates the 
distribution of the test statistic, while the distribution of the statistic itself does not 
converge. For other instances of this phenomenon, we refer to Bickel and Freedman 
(1983) or Radulovic (1998, 2012), more recently. 

Section 3 considers the more general hypothesis that the underlying copula C belongs 
to some parametric copula family {Cg, ^ G C W}. Given a sufficiently regular 
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estimator 9 and its bootstrap counterpart 6**, we adjust our statistic (|4|) and its non- 
parametric bootstrap counterpart to obtain a consistent level a test (Theorem [s]). 
Again, the result is established under very mild regularity conditions on the copula 
Ce and the estimators 9 and 9* . 

Section 4 then reports a small numerical study where we show that, in complex 
but realistic situations, our test Q is superior to the Kolmogorov-Smirnov and the 
Cramer-von Mises tests. We also comment on a possible inadequacy in the way the 
copula GOF tests are commonly evaluated. 

Finally, the proofs are collected in Section [5j The appendix contains some technical 
results from Segers (2012) and Radulovic (2012). 



2. Generalized Kolmogorov-Smirnov test 

Notation. Let H be the distribution function of the random variables (X, Y) with 
marginals F and G. We will assume throughout the paper that iJ, F and G are 
continuous. Let (Xi, Fi), ■ ■ ■ , (X„, F„) be independent copies of (X, Y). We denote 
the generalized inverses of F and G by F~ and G~, respectively. For instance, 

(6) F-{u)=mi{x\F{x)>u}. 

The empirical counterparts of H, F and G are, respectively, 

1 " 

e„(x,y) = -Vi{Xi<x,Fi<y}, 

n ^-^ 

1 " 

(7) F„(a:) = -Vl{X, <x}, 

n ^-^ 

1 " 
Gn(z/) = -$^l{F.<y}, 



n 

i=l 



with — oo < x,y < oo. We denote the copula function of (X, Y) by 
(8) G{u,v) = H{F-{u),G-{v)), 0<u,v<l 
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and its empirical estimate by 

(9) Cn{u,v) = e„(F;(M),G;(t;)), 0<u,v<l. 

The empirical copula process Z„('u, t>) = ^/n{C■n — C){u,v) is already defined in ([l]). 
We define J-'n as the class of functions 

(10) f{x,y) = Y,Ckl{{x,y)eBk}, 

k=l 

with Cfc G { — 1,+1} and disjoint rectangles B^ of the form (a, 6] x (c, d] in the unit 
square [0, 1]^, for all 1 < fc < L„. We let 

k=l 

and observe that 

T„ = sup |Z„(/)| = sup ^|Z„(5fe)|, 

where the supremum is taken over all disjoint rectangles Bi, . . . ,Bl^ of the unit 
square [0, 1]^. 

If L„ = L for all n, then J-'„ = J-" and Z„ converges in i°°{J^) to a Gaussian process 
under regularity conditions on C, see, for instance, Fermanian et al. (2004) and Segers 
(2012). As a consequence of the continuous mapping theorem, T„ trivially converges 
weakly as well. However, if L„ — )■ oo, as n — )■ oo, this is no longer true as the process 
Z„ does not converge weakly. 

The main point of this paper is to show that, provided L„ = (logn)"^ for some 
< 7 < 1, the distribution of T„ can be estimated by the bootstrap. The bootstrap 
counterparts of the above processes are defined as follows. Let the bootstrap sample 
{X^,Y*), ■ ■ ■ , (X*, Y*) be obtained by sampling with replacement from (Xi, Yi),- ■ ■ , 
{Xn,Yn)- We write 

(11) H;(x,i/) = -Y,HX* < x,Y* < y} 
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for the empirical cdf based on the bootstrap, with marginals 

(12) Ki^) = -i2^{x:<x} 



n . 
1=1 



and 



1 " 

(13) '^l{y) = -Y.^{y;<y} 



n 



and we denote its associated empirical copula function by C* . The bootstrap empir- 
ical copula process is 



(14) z: = v^ici - c„) = v^{e:(Fr,Gr) - e„(F;,Gj} . 

Assumptions. We will assume the following set of assumptions: 

(CI) The first-order partial derivatives Ci{u,v) = dC{u,v)/du and C2{u,v) = 
dC{u,v)/dv of C{u,v) exist, are of bounded variation, and satisfy, for some 
r > 0, /3 > and K < oo, 

|Ci(m, v) - Ci(s, t)| < K {u-\l - n)-^ + s-\l - sY^) {\u - s^ + \v- t^) 

and 

|C2(«, v) - C2(s, t)\<K {v-^{l - v)-^ + t-^(l - t)-P) {\u - sY + \v- tY) 

for all < u, f, s, t < 1. 
(C2) The number L„ is of order (logn)'*' for some < 7 < 1. 

Remark. We know that continuity of the partial derivatives of C{u,v) on (0, 1)^ is 
required for weak convergence, see Fermanian et al. (2004) and Segers (2012). The 
requirement that the partial derivatives are of bounded variation is natural since we 
compute the supremum of Z„ over increasingly finer families of rectangles in [0, 1]^, 
and the process Z„(m, v) is asymptotically equivalent to «„(«, v) — Ciiu, f )«„(«, 1) — 
C2(u, f)Q;„(l, f) with an{u,v) = ^/n{M.n{u,v) — H{u,v)), see Proposition [o] The ad- 
ditional requirement is weaker than imposing a Holder condition on the derivatives. 
Segers (2012) imposes a slightly stronger condition on the second partial derivatives 
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of C (corresponding to r = 1) to obtain an almost sure representation of the empir- 
ical copula process. The second assumption (C2) allows for sub-logarithmic rate in 
the sample size for the number of rectangles considered. In practice, even this fairly 
slow rate yields much better tests, see our simulations in Section 4. 



Our first result states that the processes Z„ and Z* are close in the bounded Lipschitz 
distance (Iblx that metrizes weak convergence. Formally, we define 



(15) 



dBL, (z„, K) = sup |E[/i(z„)] - E* [/i(z:; 

h 



Here E* is the conditional expectation with respect to the bootstrap sample. The 



supremum in (15) is taken over the class BLi := BLi{£^{J-'n)) of all functionals 

h: i°°{J^n) ^M with 

(16) 

and 

(17) 

ioT all x,y ei°^iJ^ n)- 



sup \h{x)\ < 1 



\h{x)-h{y)\< sup \x{f)-y{f)\ 



Theorem 1. Let Z„ = {Z„(/), / g J'n} and Z; = {Z^f), f e J'n} with 7^ as 



defined in (10) above. Under conditions (CI) and (C2), we have 
(18) dBLA'^n.K)= sup |E[/i(Z„)]-E*[/i(Z:)]|^0, 

in probability, as n -^ oo. 
Since c?blj(Z„,Z*) < 2, an equivalent statement is that 

\imE[dBLA^n,K)] =0. 
n— >oo 

Since the mapping h{X) = sup^^jr |X(/)| belongs to BLi, Theorem fll implies that, 
for any Lipschitz function g, 



(19) 



E 



g I sup |Z„(/)| 



E* 



^1 sup |z:(/)i 



^0, 
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in probability, as n — )• oo. The idea is to approximate the distribution of the statistic 
T„ by the conditional (bootstrap) distribution of 

(20) K= sup |Z:(/)|= sup 5^|Z:(5,)|. 

Corollary 2. Consider the null hypothesis Hq : C = Cq, for some copula function 
Cq satisfying (CI). Then the test statistic 



(21) Tn = v^ sup |C„(/)-Co(/)| 



with Tn O'S described in (10) above, and with Ln satisfying (C2), can be consistently 



estimated by T*, under the null hypothesis Hq : C = Cq. That is, 
(22) sup |E[/i(TO]-E*[/i(X:)]|^0, 

hi^BLi 

in probability, as n —)■ oo. 

Remark. Note that, in particular, these results continue to apply if we replace J-'„ 
by a subset J^„ C J-'„. For instance, we could restrict ourselves on regular partitions 
(based on grids in [0, 1]^). This may be important to calculate T„ in practice. 

Remark. The Corollary states that (iBi^(T„,T*) — > in probability, as n — )■ cxd. 
For our test, we need the a-upper point of the statistic T„, that we approximate by 
the bootstrap T* . The bootstrap quantile approximation is justified by the following 
simple observation. Let e > be arbitrary and define the Lipschitz function 

gt^^{x) = l{x <t}-\ l{t <x <t + e}. 

Wehave, for5„:=rfBLi(T„,T;), 

P{X„<t} < E[gt,e{Tn)] 

= r [^,,(t;)] + E [gtA^n)] - E* [gtAK)] 
< F*{Tl<t + e} + 6Je, 

since gt^e has Lipschitz constant 1/e. A similar computation shows that 

P* {T; < t - e} - 6n/e < P {T„, < t} 
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SO that, uniformly in t, 

(23) P* {T*^<t-e}- 6n/e < P{T„ < t} < P* {T^ < t + e} + Kje 
Since T* is a discrete random variable with finitely many atoms, we have 

p* {t; < t - £} = p* {t; < t + e\ 

for almost all t G M, and for e > small enough. This equality can be verified in 
practice for any given t. Then, for such a t, since 5„ — )■ in probability, as n — > oo, 
we may conclude that 

p {T„, < t} ^ p* {t; < t} 

in probability, for n large enough. This justifies the bootstrap quantile approxima- 
tion. 

Remark. This test is consistent. Indeed, under the null, observe that 

T„ < L„sup |Z„(5)| = Op{Ln). 

B 

Although this diverges, we can always estimate the distribution of T„ for n large 
enough. Under the alternative hypothesis, H^ : C = Ci for a fixed Ci ^ Cq, we 
have that 

T„ > v^|Co(5) - C^{B)\ - |Z„(5)| = Opiv'l'') 

for any rectangle B where Cq and C\ differ. Such a rectangle exists under the al- 
ternative and the increasing sequence Tn likely contains at least one such rectangle 
for relatively small n. The improved power of our test statistic is confirmed in our 
simulation study. 

Remark. Taking the supremum over all families of size L„ makes the computation 
of the statistic T„ non-trivial. If we do not take the supremum over all disjoint 
rectangles Bi, . . . , B^^ in Q, but rather limit ourselves to a particular sequence 
of families n„ with Bni, ■ ■ ■ ,BnLn ^ n„, then L„ is allowed to grow at a faster 
power rate n'', ( < 1/4 in the sample size n. Again, although the resulting statistic 
§„, = J2k=i \^n{.Bnk)\ does not have a weak limit for L„ — )■ oo, its distribution can be 
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consistently estimated by a bootstrap estimator. This test is less powerful than the 
ATV test, but S„ is easier to compute than T„. We will report on this test elsewhere. 

3. Parametric hypothesis 

In this section we consider the problem of testing if the underlying copula C be- 
longs to a parametric family C := {Cq, 6 E Q G W} . That is, the null hypothesis 
states that C = Ce^ for some 6*0 G 0. Suppose that we have a consistent estimator 

e = e{mn) of^o- 

Replacing Cq by Cg in the definition of the test statistic T„, we consider the process 

(24) Y„ = v^(C„ - C^) = Z„ - V^{C^ - C), 

and its bootstrap version 

(25) y; = z:-v^(c^.-c^). 

based on the non-parametric bootstrap estimate 6* = ^(H*), obtained after resam- 
pling with replacement from the original sample. Note that 

(26) y; = v^(c: - a. ) - v^(c„ - c.) ^ v^(c: - c.. ) . 



Indeed, the process ^/n{C^ — Cg-*), while perhaps a natural candidate, does not yield 
a consistent estimate of the distribution of Z„. 

We stress that our approach does not involve the parametric bootstrap, as studied by 
Genest and Remillard (2008) to estimate the limiting law of copula-based statistics. 
In other words, we calculate 6* after resampling from the empirical distribution EI„, 
and not from the law given by the parametric copula C^. 

We impose some regularity on our parameter estimate 6. 
(C3) There exists a-^-.R'^ ^^W with / \\ip\\^ dH < oo such that 
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and 

r-^=y"^rf(e;-e„) + 4, 

under the null hypothesis, with ||£:„|| = Op(n^^^'^ / Ln) and ||£*|| = Op*{n~^^'^/Ln) 
in probability. 

Note that the estimators satisfying (C3) are closely related to the estimators in the 
class TZ of regular estimators, as defined by Genest and Remillard (2008). 

Example (Estimators based on the inversion of Kendall's tau). As an example, 
we verify condition (C3) for estimators based on the inversion of Kendall's tau. Let 
9 = g{T) for some twice differentiable function g and Kendall's r := 4E[C0([/, V)] — 1, 
with the expectation taken over {U, V) ~ Cg. Kendall's r is estimated empirically 
by 

A n n 

Tn ■■=^^r—^^Y. E im-Yi){X,-X,) > 0} - 1. 
Then f/„ := r„ + 1 is a U-statistics of order 2 for the kernel 

hiixi,yi);{x2,y2)) = 2- l{(t/2 - yi)ix2 - Xi) > 0}. 

The projection of f/„—E[f/„] onto the space of all statistics of the form Y17=i dii^ii ^)) 
for arbitrary measurable functions Qi with E[5fj(X, F)] < oo, is 

n r) ™ 

i=l ^ i=l 

with 

/ii(x, y) = {F\x, y) + F^{x, y) - E[F'(X, Y) + F^{X, Y)] ] 

and F\x,y) = P{X < x,Y < y) and F'^{x,y) = P{X > x,Y > y). By Hajek's 
projection principle, 

Var(f/„ - f/„) = Var(f/„) - Var(f/„). 

From the proof of Theorem 12.3 in Van der Vaart (1998), due to Hoeffding (1948), 
the difference is of order 0{l/n^). Consequently, f/„ — E[f/„] = Un + Rn with i?„ = 
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Op{l/n) SO that 



n 



Tn-r = Un- nUr] = ^„ + i?„ = - ^ /ii(X,, Y,) + Op{l/n). 

'1=1 

Hence, if g is twice continuously differentiable in the neighborhood of r, a hmited 
expansion ensures that 9 := 5'(t„) satisfies the first part of (C3). The second (boot- 
strap) part of (C3) follows from the same reasoning. 

Moreover, we need more regularity concerning 9 v-^ Cq itself. 

(C4) For every (s,t) G [0,1]^, the function 9 i— t- Cg(s,t) has continuous partial 
derivatives Cg{s,t) = {d/d9)Ce{s,t) that satisfy a Holder condition with 
Holder exponent z/ > locally: there exists a constant K < oo such that 

^nv\\Ce{s,t)-Ce,{sM < K\\9 - 9or , 

s,t 

for every ^ in a neighborhood of ^o- Moreover, Cgg is of bounded variation. 

The regularity condition (C4) is satisfied for most of standard copula families. Simple 
calculations show that it is the case for the Gaussian-, Clayton- and the Frank-copula 
families in particular. Although copula derivatives dCg{x,y)/dx and dCe{x,y)dy 
with respect to their arguments often exhibit discontinuities or non-existence near 
their boundarie^ the derivatives dCg{x, y)/d9 with respect to the copula parameter 
9 behave a lot more regularly. 

Theorem 3. Let Y„ = {¥„(/), / g J^n} and Y; = {¥;(/), / g J^„} wzth 7^ 



in (10) as defined above. Assume that conditions (CI), (C2), (C3) and (C4) hold. 



Then, under the null hypothesis Hq : C = Cg, 9 E Q, we have 

(27) limE[rfBL,(Y„,Y;)] = 0. 

n— >oo 

This result implies that the distribution of the test statistic 

(28) f, = sup |Y„(/)|= sup Y.\Yn{Bk)\ 



/eJ'n Bi,--,Bl, 



k=l 



justifying conditions such as (CI), see Segers (2012). 
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can be "bootstrapped" by the distribution of 

(29) f:= sup |Y:(/)|= sup Y.\^n{Bk)\. 

Corollary 4. Assume that conditions (CI), (C2), (C3) and (C4) hold. Then, 
under the null hypothesis Hq : C = Cg, 6 E Q, 



(30) limE dBLA^n,K 



n—>-oo 



0. 



Often, (C3) can be replaced by 
(C3') There exists a ■^ : R"^ ^ W with / ||V'f dC < cx) such that 

1 " 
6-00 = - V{7/>(F„(X,), G„(r,)) - E[^{F{X,), G(r,))]} + 5„, 



n . 



and 



^ ^ 1 " 

^* - ^ = ;; 5^{^(f;(x;), G:(r;)) - ^(f^x,), G„(ro)} + 4: 



n 

i=l 



under the null hypothesis, with ||£:„|| = Op{n ^^"^ /Ln) and ||£*|| = Op»{n ^^'^ /Ln) 
in probability. 

This is a consequence of the following result. 

Proposition 5. Assume (CI) holds. Any estimator 6 satisfying (C3'), satisfies 
(C3). 

Copula parameters are typically estimated through pseudo-observations or ranks, 
without any assumption on the marginal distributions. For this reason the cop- 
ula estimators that satisfy (C3') are relevant. They are very closely related to the 
estimators in the class TZi of Cenest and Remillard (2008). In particular, the maxi- 
mum pseudo-likelihood estimator, that maximizes the pseudo log-likelihood function 
JlogCfiirfCn over ^ G 0, see, for instance, Cenest et al. (1995) or Shih and Louis 
(1995), satisfies (C3') under suitable regularity conditions on the copula density 
ce{u,v). 
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Since the bootstrapped copula process Y* is new, it is noteworthy to stress that it 
provides an valuable alternative to the usual parametric bootstrap. Now, assume 
L„ = L is a constant, to retrieve the standard framework. 

Corollary 6. Assume that conditions (CI), (C3) and (C4) hold. Then, the pro- 
cess {Y„(s,t),0 < s,t < 1} tends weakly towards a Gaussian process in £°^([0, 1]^). 
Moreover, the bootstrapped process {Y*(s,t),0 < s,t < 1} converges weakly to the 
same Gaussian process in probability in £°^([0, 1]^). 

4. Applications and Numerical Studies 

We present a limited numerical study which should serve as a proof of principle 
rather than the final word on this subject. The evaluation of GOF tests in copula 
settings is a complex problem and only partial answers can be found in literature: 
see the surveys of Berg (2009), Genest et al. (2009) and, more recently, Fermanian 
(2012). A full-scale numerical analysis is beyond the scope of this paper. 

4.1. Heuristics. For two copula densities Cq and Ci, we define the difference sets 
A^ and A^ as 

A'^ = {{s,t) : Co(s,t) > Ci(s, t)}, and A^ = {{s,t) : Co(s,t) < Ci(s,t)}. 

We assume that there exist minimal integers M+ and M~ such that 

A+ = f/^;D+ + A+ and A- = U^Dr + A^ , 

for Lebesgue null sets Aq and Aq and open sets Df and Df in [0, 1]^. The proposed 
statistic T„ is designed to sample L„ rectangles in order to maximize the difference 
between the "true" and postulated copulas. In situations where the geometry of the 
difference sets A+ and A^ is complex (most often when M+ and M~ are large), 
T„ can "pick out" the disjoint regions Df and D^ ., and one could expect superior 
performances consequently. Moreover, since we typically do not know the values of 
M+ and M~, it makes sense to let L„ — > oo, which is exactly what our theoretical 
results allow us to do. 

However, in situations where M+ and M^ are small or one of the regions Di is much 
larger than the others, it is not advantageous to sample numerous disjoint rectangles. 
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In these situations, just a single well placed rectangle can pick essentially all the mass 
of sets A~^ or A~ , while the remaining L„ — 1 rectangles are just collecting noise and 
consequently diminish the power of the statistic T^. 

Most common scenarios encountered in the literature compare Frank, Clayton, Gum- 
bel, and Gauss copulas with each other, after controlling for some dependence indi- 
cator (typically Kendall's tau): see, for instance. Berg (2009), Genest and Remilliard 
(2008) and Genest et al. (2009). However, all these pairings produce trivial difference 
sets y4+ and A~ , as revealed in the contour plots and 3D plots of cq — ci of Figure 
1. We see that nearly all the mass difference between copula densities Cq and Ci is 
concentrated in a single spot, located in either the lower left or upper right corner Q 
Therefore, these common simulation scenarios are tailored towards many standard 
GOF tests such as KS and CvM tests. 

We are not aware of any argument that justifies such specific types of pairing, except 
for analytical tractability. Figures |2] and |3| however, paint a very different scenario 
with more elaborate difference sets A~^ and A~ that appears in real life situations. 
How often and to what extent this complex situation is encountered is still an open 
issue. 

In this study, the copula densities ci were estimated by kernel density estimators 
based on the following data: 

• The bivariate ARCH process (Xi, Yi), . . . , (X„, y„), with n = 10^, was gen- 
erated as follows: First, we created independent Zi ~ A^(0, 1) and Wi = 
Zi{l + O.GWl.y/^, with Wo = 0. Second, we set (X„r,) := {W^oi, W^oi+i), 
creating nearly independent observations. 

• The Mixture Copula data (Xi, Yi), . . . , (X„, Yn), with n = 10^, are generated 
from the mixture co{s,t) = ^CF{s,t) + ^cf{1 — s,t) for the Frank copula cp 
with Kendall's r = 0.4. 

• The Euro-Dollar data (Xi, Yi), . . . , (X„, F„), with n = 1800, are quoted cur- 
rency exchange values. X is the daily percentage change of the Euro against 



Here Kendall's t — 0.4, but we observed similar plots for different values of t. 
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Figure 1. Contour and 3D plots of common comparisons: Clayton- Frank 
(left panel); Gumbel- Frank (middle panel); Clayton-Gumbel (right panel). 



the US dollar, while Y corresponds to the daily change of the Canadian dollar 
against the US dollar. 
• The Silver-Gold data (Xi, Yi), . . . , (X„, F„), with n = 5000, presents the log 
ratio of the average daily price of silver and gold futures respectively. For 
instance, Xj = \og{Si+i/Si) based on the average price Si of silver in US 
dollars on day i. 

In the case of Mixture copula and ARCH, we used co{s,t) = 1 (the independence 
copula). In the case of Euro-Dollar and Silver- Gold, we took the Frank copula den- 
sity with parameters r = 2.6 and r = 3.4, respectively, for cq. The latter parameters 
were chosen after minimizing the (estimated) Li-distance between cq and Ci. The 
difference sets are easily depicted by dark and bright sections of the contour plots 
and the 3D plots clearly indicate that the mass difference between copula densities 
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Co and ci is not concentrated in a single spot. 
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0.2 0,-1 0.6 





Figure 2. Contour and 3D plots of complex comparisons based on syn- 
thetic data: ARCH (left panel); Mixture Copulas (right panel). 



4.2. GOF tests in practice. We generated the data sets ARCH and Mixture Cop- 
ula as described above. For each data set, we run two sets of simulations: 

• (ARCH-S and Mixture-S) Test the simple null hypothesis Co{s,t) = st using 
the methodology of Section 2. 

• (ARCH-C and Mixture-C) Test the composite null hypothesis that Co is a 
Frank copula using the procedure described in Section 3. 

In both cases, the null hypothesis is wrong and should be rejected. 
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Figure 3. Contour and 3D plots of complex comparisons based on real 
data: Euro-Dollar (left panel); Silver-Gold (right panel). 



In our simulations, the statistics T„ and T^ are computed using the number of rect- 
angles Ln = [ln(ra)J — 2. In our experience [In(n)J — 3 or [ln(n)J — 1 worked equally 
well. However, we did encounter some performance issues with larger choices of L„, 
for instance L„ = n^^'^ or L„ = n^^"^. We approximated the p-values of our statistics 
via the bootstrap procedures introduced in sections 2 and 3. For each approximation, 
we used 1,000 bootstrap samples. For the second set of simulations (ARCH-C and 
Mixture-C), we computed the parameters 6 and 6* by the usual pseudo-maximum 
likelihood procedure. Each procedure is repeated 100 times. We report the percent- 
age of times that the computed p-value is below a = 0.05. 



Although the computation of ATV statistic consists merely of a complex optimization 
problem, we found that the following brute force approach yields good results: 
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(1) sample random disjoint rectangle Bi, ..., Bl„; 

(2) compute the quantity A = Y.ti l^n(5,)| or Y.ti \^n{Bi)\- 

(3) repeat this procedure 3000 ■ L^ times and record the largest quantity A. 

We implemented two simple computational tricks to assist the above scheme in order 
to speed up the procedure. 

• Since randomly chosen rectangles are often overlapping, set a := 1/2 and 
choose rectangle Bi at random such that area(i?i) < a. Then, set a : = 
a — area(Si)/2. Given i?i, ..., Bk rectangles, sample a rectangle Bk+i such that 
area(-Bjfc+i) < a and -Bjt+i ni?j = for j = 1, . . . , k. Set a := a — area(-Bfc+i)/2 
and repeat the procedure until k = L^- 

• Pre-compute and store the matrix 'Cn{i/n,j /n), 1 < i, j < n. We can simply 
recall the appropriate C„(z/n,j'/n) each time we need to compute C„(s,f'''^ 



The above computations with 1000 bootstrap samples took 10 — 15 seconds on a 
standard PC. More sophisticated optimization methods, like the Accelerated Ran- 
dom Search (Appel et al., 2004), yield much faster computation (less than 1 second). 
Our limited numerical study confirms the above assessment. Table 1 shows that the 
ATV test outperforms the KS and CvM tests in the case of complex pairing, while 
Table 2 confirms that the ATV test is inferior in case of the commonly used pairings 
of Figure 1. In Table 2, for each pair of copulas, say Clayton - Frank, we generated 
n observations from the first copula (Clayton), and we tested the null hypothesis 
that the second copula (Frank) is the true underlying copula. Table 3 shows that 
the significance level of the ATV test is below 0.05. The data was simulated from 
the null hypothesis. In all tables, Kendall's r = 0.4. 



Indeed, naive computation of C„ requires n operations. For our optimization scheme, this would 
lead to 3000 ■ n ■ Ln operations. In comparison, using recursion, the grid can be computed with n^ 
operations and only once, resulting with the total of n^ + 3000L„ operations. 
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type 


n 


ARCH-S 


ARCH-C 


Rotated-S 


Rotated -C 


ATV 


400 


75% 


80% 


41% 


25% 


KS 


400 


6% 


4% 


8% 


12% 


CvM 


400 


25% 


50% 


6% 


15% 


ATV 


800 


100% 


99% 


94% 


98% 


KS 


800 


32% 


50% 


20% 


25% 


CvM 


800 


50% 


92% 


31% 


84% 



Table 1. (Complex pairing related to Figure 2) Relative frequencies 
of rejected null hypotheses under a = 0.05. 



type n Clayton - Frank Gumbel - Frank Clayton - Gumbel 



ATV 400 


42% 


26% 


88% 


KS 400 


58% 


25% 


90% 


CvM 400 


84% 


47% 


95% 


ATV 800 


92% 


58% 


94% 


KS 800 


98% 


53% 


98% 


CvM 800 


100% 


73% 


100% 



Table 2. (Trivial pairing related to Figure 1) Relative frequencies of 
rejected null hypotheses under a = 0.05. 



type n Clayton - Clayton Gumbel - Gumbel Frank -Frank 

ATV 400 3% 2% 2% 

KS 400 4% 5% 4% 

CvM 400 4% 5% 4% 

ATV 800 2% 4% 3% 

KS 800 3% 3% 5% 

CvM 800 5% 3% 6% 

Table 3. (Errors of the first kind) Relative frequencies of rejected 
null hypotheses under a = 0.05. 



5. Proofs 

Throughout the proofs, we assume without loss of generality that F = G = I 
(uniform marginal distributions). This implies that H = C. This is justified by the 
following lemma. 
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Lemma 7. Let F, G be continuous distribution functions. Denote by H the cdf of 
{F{X),G{Y)) and by C its associated copula. The empirical copula associated to the 
sample (F(Xi), G(Fi)), . . . , (F(X„), G(F„)) is denoted by C„. We have 

C{x, y) = C{x, y) = H{x, y) for all x,y e [0, 1]. 

Moreover, 

C„ ( -,- ) = C„ ( -,- ) forij = 0,l,...,n. 

Proof. See Lemma 1 in Fermanian et al. (2004). D 

Since the letter C is reserved for the copula function, we use the letters K,Ko,Ki, 
etc. in the sequel to denote generic constants. 



5.1. Proof of preliminary results. In general, note that, for each f e Tn 



defined in (10), we can write 



fc=l j=l 

and 

i=i 
for some aj E { — 1, +1} and < Sj,tj < 1 using formula ([S]). Let 

an = v^{Mn - H) 

be the ordinary empirical process in [0, 1]^, and let its oscillation modulus be defined 

as 
(31) 
Mn{5) :=sup{|a„(s,t) -a„(s',t')l : \s - s\ < 5, |t - t'| < (5, < s,s',t,t' < 1}, 

for any 5 > Q. 



22 FERMANIAN, RADULOVIC, AND WEGKAMP 

Lemma 8. Let ((5„)„>o be a sequence of positive real numbers such that n6n/iogn — )■ 
oo. Then we have 

Mn{6n) = 0((5y^(logn)^/^) almost surely. 



Proof. We apply Proposition 13 with A„ = K^bJ, (logra)^/^ for some constant Kq > 0. 
Since n~^^'^\n/5n = Kq {\ogn/{n5n)) tends to zero, this inequahty can be rewritten 

P{M„(5„) > A4 < ^exp (^2:^t!^\ = K,ne^^ {-K,Kl^{l)\ogn) , 

for some constants Ki, K2 and n sufficiently large. When Kq is sufficiently large, we 
check that 

P{M„(5„)>A„}<^, 
for some constant K^. Invoke the Borel-Cantelli Lemma to conclude the proof. D 

In addition, set 

(32) I'n{s,t) = a„(s,t) -Ci{s,t)an{s,l) - (^2(5, t)a„(l, t), 

and define 

^n(/) = y^q-fc^ra(gfc,4)- 
fc=l 

Proposition 9. Under conditions (CI) and (C2), we have 

lim (iBL^(Z„,Z„) = 0. 

Proof. First, we observe that 

(iBLj(Z„,Z„) = sup ¥.[h{Zn) - hiZn)] 

heBLiie^iTn)) 

< 5 + 2pJ sup |Z„(/)-Z„(/)|>(5 

The latter inequality holds for any 6 > 0, and uses the fact that \h\ is bounded by 1 
and has Lipschitz constant 1. It remains to show that 

sup |Z„(/)-Z„(/)|^0, 
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in probability, as n — ?■ oo. Second, we note that 

sup |Z„(/) -Z„(/)| < 4L„ sup |Z„(s,t) -Z„(s,t)|. 

The remainder of the proof generahzes Proposition 4.2 of Segers (2012). Now, 

sup |Z„(s, t) — Z„(s, t)| < I + II 

0<s,t<l 



with 



and 



sup |a„(F„s,G„t) -a„(s,t)| 

0<s,t<l 



// = sup |v^[C(F;s,G;t)-C(s,t)] +Ci(s,t)a„(s,l) + C2(s,t)a„(l,t)|. 

0<s,t<l 

The first term, J, can be bounded as follows. Set /3„i(s) = y/n(¥^s — s) and Pn2if) = 
^/n{G~t — t). By the Chung-Smirnov LIL, we have 

sup |/3„i(s)| + |/3„2(t)| = 0((loglogn)^/2) almost surely. 

0<s,t<l 

Using Lemma p^ with 5 = n~^/^(loglogn)^/^, we get 

sup \an{s,t)-an{s',t')\ = 0(n-i/^(logn)i/2(loglogn)i/^), 

|5-s'| + |t-t'|<<5 

almost surely. This implies that / = 0(?7.~^''^(logr2)^/^(loglogn)^/^), almost surely. 
For the second term, we note that 



// = sup \V^[C{¥-s,G-t)-C{s,t)] + Ci{s,t)an{s,l) + C2{s,t)an{l,t)\ 

0<s,t<l 

< Ila + IIb + IIc, 
with 
Ila = sup \V^[C(¥;^s,G-t)-C{s,t)] + Ci{s,t)an{s,l) + C2{s,t)an{l,t)\ 

s,i^{e„,l-e,i) 

lib = sup |Ci(s„,t„)/3„i(s) + Ci(s,t)a„(s, 1)1, 

e„<s,i<l-e,i 

lie = sup |C2«,t'„)/3n2(t) + C2(s,t)a„(l,t)| 

e„<s,i<l— £„ 
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Here £„ — )■ will be specified later, and |s„— s| < ra^-'^/^|/3„i(s)|, |t„— 1| < n^^^'^\/3n2it)\, 
|< -s\< n-i/2|/3„i(s)| and \t'^ - t\ < n-'/'\Mt)l 

Let us bound Ha. Since |Ci| < 1 and IC2I < 1 as copulas are Lipschitz with Lipschitz 
constant 1, we get by the mean value theorem that 

Ila < sup [|/3„i(s)| + |/3„2(t)| + |a„(s,l)| + |a„(l,t)|]. 

s,te[0,e„]U[l-£„,l] 

The Bahadur-Kiefer theorem (Shorack and Wellner 2009, p. 585) states that 
sup \l3ni{s) + q;„(s, 1)1 = 0(n^^'^(logra)^'^(loglogn)^'^) almost surely. 

0<s<l 

Furthermore, 

En^'"^ sup |a„(s, 1)1 < sup s^^''^{l — s)^^''^\an{s,l)\ 

= (logn)^'^ almost surely, 

see Theorem 2 in Mason (1981). (Alternatively, we can apply Lemmajslfor ne„/ log n — )• 
00.) Combining all these bounds entails then 

Ila = 0{el/\\ogny/^ + n-^/\\ogny/\\og\ogny/^) almost surely. 

Next, we bound lib as follows. 

lib = sup |Ci(s„,t„)/3„i(s) + Ci(s,t)a„(s, 1)1 

en<s,t<l-e„ 

< sup \f3ni{s) + an{s,l)\+ sup |Ci (s„, t^) - C'i(s, t) | |a„(s, 1) | . 

e„<s,t<l—e„ en<s,t<l~s„ 

The first term on the right is 0(n^^/'*(logn)^/^(loglogn)^/^) almost surely, as we 
have seen before. For the second term, using condition (CI) we have, for some r > 0, 
/3 > and K < oo, 

sup \Ci{Sn,tn) -Ci{s,t)\ |a„(s,l)| < 

e„<s,t<l-en 



K sup {\sn - sf' + \U - tn (s-^(l - s)-^ + s;^(l - Sn)-^) |a„(s, 1)|. 

e„<s,t<l—E„ 



Since 



s{l+^-^-]>s{l-^-^^^^] >^, 
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and 



F ~ 2 

almost surely and for n sufficiently large, for all £„ — )■ and ne^/logn — )> oo. We 
deduce, using Theorem 2 in Mason (1981), 

sup |Ci(s„,t„) - Ci(s,t)| |a„(s, 1)1 

e„<s,t<l-e„ 



< sup ir(|s„-s|" + |t„-tns2-'^(l-s)2-^(logn 

e„<s,t<l~en 



i-/3(i_s)|-/3(iogn)i/2 
< irn-'-/2max(£r^l)(logn)i/2 ^^p (|/3„i(s)r + |/3„2(t)r 

£„<s,t<l-e„ 

= O (n-"/2(logn)^/2(iQgiQg^)r/2^g^^^^^^ ^^ 



almost surely. We now specify the choice of e„ = n~^ further, with p depending on /3 
and r. If 2/3 > 2r + 1, we take < p < r/(2/3 - 1). Otherwise, we take < p < 1/2. 
For both cases, these choices ensure that lib = 0{n^'^) almost surely, for some g > 0. 
The same bound applies to He. Since L„ = o(logn) by assumption (C2), we obtain 
L„(J + //) — 7- 0, almost surely, as n — )> oo, and the proof is complete. D 

Next, we turn our attention to the bootstrap counterparts. We define a^{s,t) = 
^/n{M.^ — EI„)(s,t) as the ordinary bootstrap empirical process in [0, 1]^. We prove 
the following exponential inequality for the oscillation 

M:(5)= sup K(s,t)-<(s',t')l 

|s-s'|<<5, \t-t'\<5 

of this process. 

Lemma 10. For all bounded sequenees 5„ sueh that n(5„/log(n) — ?> oo as n — )■ oo, 
(33) M;(5„) = 0(5y2(logn)i/2) almost surely. 

Note that the sequence (5„) may be constant. 
Proof. Since a* is a step function, we find that 

sup \a*Xs,t) - al{s\t')\ 

|s-s'|<5„,|t-t'|<5„ 
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equals 

max \al{Xi,Yj) - al{Xi>,Yj>)\. 

Let us rewrite 

n 
k=\ 

as a sum of bounded independent random variables with 

r,,,v/ := ^{xi < X,, f; < r,} - \{xi < x,,, y; < f,,}, 

conditionally on the sample (Xi, Yi), . . . , (X„, y„). Moreover, a simple calculation 
and Lemma |8] yield 

Var*(r,,,,y) < P*{^i<^fc<^.'} + P*{>^.<n*<>S'} 

< sup[F,(s + bn) - F„(s)] + sup[G„(t + bn) - G„(t)] 

s t 

< 2bn + 2n-^^^Mn{bn) 

< Kmax{bn,Mn{bn)/Vn) 

< Kmax{bn,^/bn^ogn/^/n) 

= Kbn 

for n large enough, for almost all realizations (Xj, Yj) and for some constant K > 
0. Hence, by the union bound and Bernstein's exponential inequality for bounded 
random variables, we have, for some constant Ki, 

|<(x„y,)-<(x,,,F,oi>4 

< 2n^ exp {-Kl{^/^x A x'^b~^)) , 



P* <! max 

^,^',j,j'e{l,■■■,n},\X^-X^,\<S„, \Y,-Y^, 



for all samples (Xi, Yi), . . . , (X„, F„). By integrating the previous inequality over P, 
we get the same inequality, but replacing P* by P. Set x = K^bJ (logn)^/^ and take 
the constant Kq is sufficiently large, to obtain 

oo 

5^P{M:(5„) > K,bl!\\ognYl^] < +00. 

n=l 

Apply the Borel-Cantelli lemma to conclude the proof. D 
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Analogous to the approximation of the process Z„ by Z^ before, we introduce a 
simpler process Z* to approximate Z*. Set 

(34) z;(s,t) = v^(e;-e„)(s,t)-Ci(s,t)v^(F;-F„)(s)-C2(s,t)v^(G;-G„)(t), 

and 

4L„ 



KW = J2'"'^n{Sk,h). 



fc=l 



Proposition 11. Under conditions (CI) and (C2), we have 



lim E 

n— ^-oo 



Proof. First, we notice that, for any (5 > 0, 



0. 



(35) 



E 



dsL^'Ki^'K: 



< (5 + 2E 



P* { sup 



Kif) - Kif) 



> 6 



Hence it suffices to show that 



(36) 



sup 



Kif)-Kif) 



^0 



in P*-probability, for all realizations of (Xi, Yi), . . . , (X„, y„) in an event Qn with 



P{n„} — )■ 1. We now verify (36). Some straightforward adding and subtracting 
yields 



with 



z;(s,t) = Zl{s,t) + R:{s,t) 



K{s,t) = v^{M:(s,t) -H„(s,t)} + V^{C{¥nS,Gnt)-C{¥*^s,G*J)} 



and 

Kis,t) 



n {C{¥l-s, Gl-t) - C{s, t) } - V^{C{s, t) - ^(F:^, G^t)} + 
-v^{C(F„s,G„t) - Cis,t)} - v^{C(F;s,G;t) -C(s,t)} 
+ {a:(F-s,G;t)-a:(s,t)} 

+ {«„(F:-s,Grt) - «„,(F;.,G;t)} + 

+ {a:(¥:-s,G:-t) - <(F;s,Grt)} + {a:(¥-s,G:-t) - a:(¥-s,G-t)} 
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Next, we observe that 



") 



dBLAK,K) < 4L„sup|Z:(s,t)-Z:(s,t)| 

s,t 

< 4L„sup|Z:(s,t)-Z;(s,t)|+4L„sup|K(s,t)|. 

s,t s.t 

Now, use the same techniques as in the proof of Proposition |9j adapted to boot- 
strapped empirical processes. To be specific, with the aid of the following approxi- 
mation results: 

sup K(s,l)|/(s^/2(l - s)i/2) = Op.(l) almost surely, 

0<s<l 

see Theorem 2.1, display (2.10) and Remark 2 at page 1457 in Csorgo and Mason 
(1989), 

sup [|ttn(s, 1)1 + |Q;*(l,t)|] = 0{el/'^{\ogny''^) almost surely, 

s,te[0,e„]U[l-e„,l] 



see Lemma 10 above 



sup \(3*^iis) + al{s, 1)1 = 0{n-^/^{\ogny/^{\oglognY/^) almost surely, 

s 

see displays (2.10') and (2.12') in Theorem 2.1 of Csorgo and Mason (1989), we can 
show that 4L„sup^^ |Z* (s,t) — Z*(s, t)| +4L„ sup^ j |i?*(s, t)| = Op. (L„n~'^) for some 
^ > 0, in probability, proving the result. 
For instance, to deal with i?*, note that 

\a:{¥-s,G-t)-a:{s,t)\ < M;(5), 



with 5 = 0{n ^/^(loglogn)^/^) almost surely. Consequently, by Lemma 10 this term 



is of order Op* {n "), for < a < 1/4. Another term in i?* can be handled as follows: 

|a„(F:-s,Grt)-a„(F;s,G;t)| < M„,(5) 

with 5 = Op*{n^^^'^), so that the term on the left is of order O p* (n^ ^ ^'^ {log nY^"^) by 
Lemma [8] The other terms can be treated in the same way. D 



sup |E[/i(z„)]-E*[/i(z;; 

heBLi 



< sup 

heBLi 



+ sup 

heBLi 



+ sup 

heBLi 



E[/i(Z„)]-E*[/i(Z:)] 
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5.2. Proof of Theorem [I| We need to show that cIbli (^n, Z^) -^ in probabihty, 
as n — 7- oo. By triangle inequahty, we have, with probabihty one, 

dBLii'^n, ^n, 

h&BLi 

E[/i(Z„) - /i(Z„)] 

¥:[h{K)-h{zi)] 

Since Proposition [9J shows that dBhA'^ni^n) — )■ as n — > oo and Proposition |ll| 
imphes that dBL^'^ni'^n) ~^ 0) i^ probabihty, as n — )■ oo, it remains to show that 
dBLii'^n, ^*n) -^ 0, in probabihty, as n — > oo. We recaU that 

4L. 



^(^k fk{x,y)dan{x,y), 

k=l -^ 



for 



fkix,y) = l{x <Sk,y< tk} -Ci{sk,tk)l{x < s^} - C2{sk,tk)l{y < h}. 
Now, let hf{x,y) = Y.tt{(^kfk{x,y) so that 

(37) ZM= [hfdan, 
and we can derive in the same way 

(38) Kif)= [hjda:. 



We now apply Theorem 3 in Radulovic (2012), stated as Theorem 12 in the appendix 
for convenience. We need to verify that 
• the three classes 

Gl = {l{x<Sk,y<tk}, 0<Sk,tk<l}, 
G\ = {Ci{sk,tk)l{x< Sk}, 0< Sk,tk<l}, 

Ql = {C^2(Sfc,4)l{l/<tfc}, 0<Sfc,tfc<l}, 
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have VC-indices 14", V^ and V^, respectively, with EtSl^fc" + ^k + ^k) < 
Kilogn)"' for some finite constant K and some < 7 < 1. 
• the class "H^ = {hf : / G J-'„} has an envelope H{x,y) with E[iJ'^(X, F)] < 
00. 

First we verify the VC property. The class Q^ is VC with VC-dimension V^ = 3 
(Van der Vaart and Wellner (1996, page 135)), while the class Q/^ is a subclass of 
the class of functions dl{a < x < b} with a,b E M. and d > 0. This class has a VC 
index 3 : see van der Vaart and Wellner (1996, Problem 20, page 153). The same 
reasoning applies for the class Q^., and consequently 

^(\4« + V,' + V^) < 36L„ < K{\ognr 

k=l 

for some i^ < 00. It remains to verify the envelope condition. Writing 

g^,y{s,t) = l{x<s,y<t}-Ci{s,t)l{x<s}-C2{s,t)l{y<t} 



we see that 



hfix,y) = ^Ckgx,yiBk 



k=l 



for Ck = ±1 and Bk = [si, si] x [tl,tl]. Here {sL^L ■^L^i}fe=i ^^^ the appropriately 
relabeled {sk, ^A:}t=i ^^^ the operation (j){Bk), for any function : M^ — )> M, is defined 

as 



{4A)-<Pi4A)-<Pi4,tl) + <Pi4,tl). 



Furthermore, writing 



Ix,y{s,t) = l{x < s,y <t}, 
IL^^y{s,t) = Ci{s,t)l{x <s}, 
IIh,y{s,t) = C2{s,t)l{y<t}, 
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we have 



\hf{x,y)\ 



7 ^Ckgx,y{Bk 



k=l 



< Y, U^ABk)\ + Y, \^^-ABk)\ + Yl \^^^-ABk 



k=l 



k=l 



k=l 



Trivially, since rectangles are disjoint, the first term ^^.^i |-^i',?;(-Bfe)| is bounded by 
1. The estimate for the two remaining terms on the right are identical (just use C2 
instead of Ci), so we bound J2k=i \^^x,y{Bk)\ only. We first observe that, after a 
little algebra, 

II,,y{B,) = [C,{sltl)-C,{sltl)]l{sl<x<sl} + l{x<sl}C,{B,). 

This identity yields 

Ln Lin L 



fe=i 



fc=i 



k=l 



Ln 



< 2 5^1{4<x<4} + 5^|Ci(i?, 



fc=i 



fc=i 



< 2 + TV{Ci). 



The last inequality follows from ||Ci||oo < 1 ( since copulas are Lipschitz with Lips- 
chitz constant 1) and our assumption (CI) that TV{Ci), the total variation of Ci, 
is finite. Consequently, the class T-Ln has envelope 5 + TV{Ci) + TV{C2)- 



We can now apply Theorem 12 to conclude that dsLii'^ny'^n) ~^ 0; i^ probabihty, 

as n —7- 00. D 



5.3. Proof of Theorem [3} We proceed as in the proof of Theorem [TJ We write 
C = Cn and C* = Cn, . Recall that 



Y„ = Z„-v^(C-C). 
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We may replace Z„ by Z„ with impunity since 

dBLA^nX-V^{C-C)) 



< (5 + 2P<^ sup 

= (5 + 2pJ sup 
— > (5 as ra — )■ oo, 



¥„(/)- z„(/) + v^(a-c)(/) 

Zn(/)-Z„(/) >6 



> 6 



for every 5 > 0, as in the proof of Proposition |9j Next, by the mean value theorem 
and assumptions (C3) and (C4), we have 

^^((?-C)(s,t) = 



for some 6 between 6 and ^o 

ijdan + n^/O Ceo{s,t) + ^{6- 6,)' {C-,{s,t) - Ce,{s,t)} 



ipdanj Ceois,t) + Rn{s,t) 
for some remainder term Rn that satisfies 

\Rnis,t)\ < n'/^\\en\\\\Ce,is,t)\\+Kn^/^\\e-eot 
= 0,(nV2||5„||+n-)/2) 
= Op(l/L„). 

This bound holds uniformly in s and t. Consequently, for 



4L 



Yn(/)= J]^fcY„(Sfc,t, 



k=l 



based on 



we have 



Y„(s,t) = Z^is,t)-{ ijdan] Ce,{s,t) 



dBLA^n-MC-C),Y^) = rfBLi(Y„-i?„,Y„). 
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Since 

SUp|i?„(/)| < Ln sup \Rn{s,t)\ -^ 



in probability, we get dshA'^n — Vn{C — C), Y„) — )■ 0, as n — )■ oo. We conclude that 

limsuprfijLj(Y„,Y„) = 0. 
For the bootstrap counterpart, we can argue in the same way. Using the expansion 

for some remainder term i?* that satisfies 

sup |K(s,t)| < K^n^l'^WelW + KYa^^''\\d - Q^f^' + K2n^l''\\d* - ^f +^ 

for some finite constants Kq^ Ki and i^2- We check that the processes Y* and Y* are 
close, in the sense that 

in probability, as n — ?■ oo. Here, Y* is based on 

Note that Y„(/) = Y.k^k^nisk,tk) = J {Y.k^k9k) dan with 

gk{x,y) = l{x < Sk,y <tk} -Ci{sk,tk)l{x < Sk} -C2{sk,tk)l{y <tk} 
- i^ix,y))'Ceoisk,tk). 
As in the proof of Theorem [T| it remains to verify the two conditions of Theorem 



12 Since the only difference with the proof of Theorem [T] is the addition of the term 



('?/'(x,y)) Ceg{sk,tk), we concentrate on the class of functions {ip{x,y)y CoQ^Skytk). 
Since it is a subclass of c'ip{x, y) with c G M^, its VC dimension trivially is equal to 
p. Moreover, it is not hard to see from the proof of Theorem [T] that 



4L„ 



J2^k9k{x,y)\ < 5 + TV{C,) + TV{C2) + mx,y)\\TViCe,), 



k=l 



Since E[||'?/'(X, y)||^] < oo, the conditions of Theorem 12 are met, and we conclude 
that dsLiO^ny^n) — ?• lu probability, as n — )■ oo. D 
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5.4. Proof of Proposition [5} From the proofs of Proposition |9] and Proposition 



11, we see that 



sup |Z„(M,t>) — Z„('u, f )| = Op (n ^) , and 

(«,u)6[0,l]2 

sup \Zl{u,v) -Ki^^,v)\ = Op, {n-^^) 
almost surely, for some /i > 0. The result follows after integration by parts. D 



5.5. Proof of Corollary [6| By the delta-method, {Y„(s,t),0 < s,t < 1} con- 
verges towards a Gaussian process in £°^([0,1]^). The proof of Theorem p] shows 
that limsup„_^o^ (iBLi(Y„, Y„) = 0. Hence, the process Y„ converges weakly to the 
same limit as Y„. This proves the first claim. The second part of the Corollary is a 
straightforward consequence of Theorem [3] and the triangle inequality applied to the 
distance (Ibli- D 

Appendix A. 

Let Xi, . . . ,Xn be independent random variables with probability measure P. Let 
P„ be the empirical probability measure, putting mass 1/n at each observation, and 
let P* be the nonparametric bootstrap measure based on n independent observations 
from P„. We index the empirical process -\/n(P„ — P) and its bootstrap counterpart 
ri(P* — P„) by functions / that belong to a sequence of classes J-'„. 



Theorem 12. Let dn be an integer sequence and, for each I < i < dn, let Qi^n be a 
VC class of functions with VC index Vi^n CLnd 

Y,Vi.n<K{\ogn)\ 

i=l 

for some K < oo and < 7 < 1. Set 

{d„ 
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and suppose that there exists an envelope function F > sup.gjr |/|, independent of 
n, with E[F^(X)] < oo. Then, 

dBL, ({v^(Pn - P){f), feJ^n}, {V^iK " ^n){f), f G ^J) ^ 0, 

in probability, as n -^ oo. 

Proof. See Theorem 3 in Radulovic (2012). D 

Appendix B. 



Set M„(5) as in (31) for 5 >0, and define 

ij{x) = 2x~^{(l + x) log(l + x) -x}, X e (-1, 0) U (0, cx)) 
and ^(—1) = 2 and V'(O) = 1- This function is continuous and decreasing. 

Proposition 13. There exist constants Ki and K2 such that 
(39) P{M„(a) > A}< — expj-^^^^ ^ 



a \ a \ \/na 

for all a G (0, 1/2] and all A G [0, 00). 

Proof. See Proposition A.l of Segers (2012). Segers credits Einmahl (1987) who 
proved this bound for the independence copula C{u, v) = uv and Tsukahara (private 
communication) for observing that the inequahty holds for any copula. D 
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